AITopics | formal method

Collaborating Authors

formal method

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

STL: Still Tricky Logic (for System Validation, Even When Showing Y our Work) Isabelle Hurley

Neural Information Processing SystemsFeb-18-2026, 08:03:22 GMT

Previous work showed that despite claims of interpretability, humans are unable to use formal specifications presented in a variety of ways to validate even simple robot behaviors.

logic & formal reasoning, machine learning, specification, (20 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
North America > United States > Massachusetts > Middlesex County > Lexington (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
(4 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study > Negative Result (0.67)

Industry:

Education > Educational Setting (1.00)
Government > Military (0.93)
Government > Regional Government > North America Government > United States Government (0.93)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (0.90)

Add feedback

Position: Vibe Coding Needs Vibe Reasoning: Improving Vibe Coding with Formal Verification

Mitchell, Jacqueline, Shaaban, Yasser

arXiv.org Artificial IntelligenceNov-4-2025

``Vibe coding'' -- the practice of developing software through iteratively conversing with a large language model (LLM) -- has exploded in popularity within the last year. However, developers report key limitations including the accumulation of technical debt, security issues, and code churn to achieve satisfactory results. We argue that these pitfalls result from LLMs' inability to reconcile accumulating human-imposed constraints during vibe coding, with developers inadvertently failing to resolve contradictions because LLMs prioritize user commands over code consistency. Given LLMs' receptiveness to verification-based feedback, we argue that formal methods can mitigate these pitfalls, making vibe coding more reliable. However, we posit that integrating formal methods must transcend existing approaches that combine formal methods and LLMs. We advocate for a side-car system throughout the vibe coding process which: (1) \emph{Autoformalizes} specifications (2) Validates against targets, (3) Delivers \emph{actionable} feedback to the LLM, and (4) Allows intuitive developer influence on specifications.

large language model, natural language, specification, (14 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3759425.3763390

2511.00202

Country: North America > United States > California > San Francisco County > San Francisco (0.14)

Genre: Research Report (0.50)

Industry: Information Technology > Security & Privacy (0.34)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

STL: Still Tricky Logic (for System Validation, Even When Showing Y our Work) Isabelle Hurley

Neural Information Processing SystemsOct-10-2025, 18:14:53 GMT

Previous work showed that despite claims of interpretability, humans are unable to use formal specifications presented in a variety of ways to validate even simple robot behaviors.

experiment, specification, trajectory, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
North America > United States > Massachusetts > Middlesex County > Lexington (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
(4 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study > Negative Result (0.67)

Industry:

Education > Educational Setting (1.00)
Government > Military (0.93)
Government > Regional Government > North America Government > United States Government (0.93)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (0.90)

Add feedback

AgentGuard: Runtime Verification of AI Agents

Koohestani, Roham

arXiv.org Artificial IntelligenceSep-30-2025

The rapid evolution to autonomous, agentic AI systems introduces significant risks due to their inherent unpredictability and emergent behaviors; this also renders traditional verification methods inadequate and necessitates a shift towards probabilistic guarantees where the question is no longer if a system will fail, but the probability of its failure within given constraints. This paper presents AgentGuard, a framework for runtime verification of Agentic AI systems that provides continuous, quantitative assurance through a new paradigm called Dynamic Probabilistic Assurance. AgentGuard operates as an inspection layer that observes an agent's raw I/O and abstracts it into formal events corresponding to transitions in a state model. It then uses online learning to dynamically build and update a Markov Decision Process (MDP) that formally models the agent's emergent behavior. Using probabilistic model checking, the framework then verifies quantitative properties in real-time.

agent, artificial intelligence, machine learning, (14 more...)

arXiv.org Artificial Intelligence

2509.23864

Country: Europe (0.68)

Genre: Research Report (0.64)

Industry: Education (0.35)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.67)

Add feedback

Bridging Language Models and Formal Methods for Intent-Driven Optical Network Design

Bekri, Anis, Abane, Amar, Battou, Abdella, Bensalem, Saddek

arXiv.org Artificial IntelligenceSep-30-2025

Abstract--Intent-Based Networking (IBN) aims to simplify network management by enabling users to specify high-level goals that drive automated network design and configuration. However, translating informal natural-language intents into formally correct optical network topologies remains challenging due to inherent ambiguity and lack of rigor in Large Language Models (LLMs). T o address this, we propose a novel hybrid pipeline that integrates LLM-based intent parsing, formal methods, and Optical Retrieval-Augmented Generation (RAG). By enriching design decisions with domain-specific optical standards and systematically incorporating symbolic reasoning and verification techniques, our pipeline generates explainable, verifiable, and trustworthy optical network designs. Intent-Based Networking (IBN) simplifies network management by allowing users to express high-level objectives--such as connectivity, performance, or security--without specifying implementation details [1], [2]. Standardization bodies like TM Forum and the Internet Engineering Task Force define intent as a declarative statement of desired outcomes, delegating the detailed configuration and implementation tasks to automated systems. By abstracting away low-level complexities, IBN significantly reduces operational overhead, human error, and management complexity [2]. Existing research predominantly explores intent translation into configurations or incremental topology adjustments [3], [4], but largely overlooks the initial phase of comprehensive network design, particularly for optical networks. Poor initial design decisions can lead to significant performance degradation or expensive reconfigurations throughout the operational lifecycle [5], [6].

large language model, machine learning, specification, (20 more...)

arXiv.org Artificial Intelligence

2509.22834

Country: Europe > France (0.14)

Genre: Research Report (0.82)

Industry: Telecommunications > Networks (1.00)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

Revisiting Formal Methods for Autonomous Robots: A Structured Survey

Azaiez, Atef, Anisi, David A., Farrell, Marie, Luckcuck, Matt

arXiv.org Artificial IntelligenceSep-26-2025

This paper presents the initial results from our structured literature review on applications of Formal Methods (FM) to Robotic Autonomous Systems (RAS). We describe our structured survey methodology; including database selection and associated search strings, search filters and collaborative review of identified papers. We categorise and enumerate the FM approaches and formalisms that have been used for specification and verification of RAS. We investigate FM in the context of sub-symbolic AI-enabled RAS and examine the evolution of how FM is used over time in this field. This work complements a pre-existing survey in this area and we examine how this research area has matured over time. Specifically, our survey demonstrates that some trends have persisted as observed in a previous survey. Additionally, it recognized new trends that were not considered previously including a noticeable increase in adopting Formal Synthesis approaches as well as Probabilistic Verification Techniques.

artificial intelligence, logic & formal reasoning, logic programming, (17 more...)

arXiv.org Artificial Intelligence

doi: 10.1007/978-3-032-01486-3_26

2509.20488

Country:

Europe > Italy (0.28)
Europe > United Kingdom > England (0.28)

Genre:

Overview (1.00)
Research Report (0.84)

Industry:

Information Technology (0.68)
Transportation > Infrastructure & Services (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)

Add feedback

It Takes a Village: Bridging the Gaps between Current and Formal Specifications for Protocols

Communications of the ACMJul-23-2025, 14:35:37 GMT

Network protocols are fundamental to the functioning of modern society, as more and more services are provided online, including critical infrastructure. Historically, these services have mostly been implemented with wired networks that connect conventional computers. But with the emergence of wireless and cellular technologies, their reach has extended to Internet of Things (IoT) devices and sensors, robotic systems, and autonomous vehicles. The next generation of cellular technologies, often referred to as NextG, will further expand the set of services that can be delivered over a network, including real-time remote healthcare, industrial automation, precision agriculture, smart infrastructure, holographic communication, space-based communication, and more. Experience shows that formal specification can provide significant practical benefits for network protocol development by enhancing clarity, trustworthiness, and productivity. Adoption of formal methods has been hindered not by lack of scalability but by cultural gaps that result in tools and methods that do not align with engineering practice.

artificial intelligence, logic & formal reasoning, logic programming, (9 more...)

Communications of the ACM

Industry:

Health & Medicine (0.57)
Food & Agriculture > Agriculture (0.57)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)
Information Technology > Communications > Networks (0.77)

Add feedback

Towards provable probabilistic safety for scalable embodied AI systems

He, Linxuan, Jia, Qing-Shan, Li, Ang, Sang, Hongyan, Wang, Ling, Lu, Jiwen, Zhang, Tao, Zhou, Jie, Zhang, Yi, Wang, Yisen, Wei, Peng, Wang, Zhongyuan, Liu, Henry X., Feng, Shuo

arXiv.org Artificial IntelligenceJul-23-2025

Embodied AI systems, comprising AI models and physical plants, are increasingly prevalent across various applications. Due to the rarity of system failures, ensuring their safety in complex operating environments remains a major challenge, which severely hinders their large-scale deployment in safety-critical domains, such as autonomous vehicles, medical devices, and robotics. While achieving provable deterministic safety--verifying system safety across all possible scenarios--remains theoretically ideal, the rarity and complexity of corner cases make this approach impractical for scalable embodied AI systems. Instead, empirical safety evaluation is employed as an alternative, but the absence of provable guarantees imposes significant limitations. To address these issues, we argue for a paradigm shift to provable probabilistic safety that integrates provable guarantees with progressive achievement toward a probabilistic safety boundary on overall system performance. The new paradigm better leverages statistical methods to enhance feasibility and scalability, and a well-defined probabilistic safety boundary enables embodied AI systems to be deployed at scale. In this Perspective, we outline a roadmap for provable probabilistic safety, along with corresponding challenges and potential solutions. By bridging the gap between theoretical safety assurance and practical deployment, this Perspective offers a pathway toward safer, large-scale adoption of embodied AI systems in safety-critical applications.

ai system, data mining, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2506.05171

Country:

North America > United States (0.93)
Europe (0.67)

Genre: Research Report (0.84)

Industry:

Information Technology > Security & Privacy (1.00)
Transportation > Air (0.93)
Health & Medicine (0.68)
(2 more...)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(2 more...)

Add feedback

Systems Correctness Practices at Amazon Web Services

Communications of the ACMMay-29-2025, 13:30:41 GMT

Amazon Web Services (AWS) strives to deliver reliable services that customers can trust completely. This requires maintaining the highest standards of security, durability, integrity, and availability--with systems correctness serving as the cornerstone for achieving these priorities. An April 2015 article published in Communications of the ACM, titled "How Amazon Web Services Uses Formal Methods," highlighted the approach for ensuring the correctness of critical services that have since become among the most widely used by AWS customers.21 Central to this approach was TLA,14 a formal specification language developed by Leslie Lamport. Our experience at AWS with TLA revealed two significant advantages of applying formal methods in practice.

artificial intelligence, correctness, logic & formal reasoning, (6 more...)

Communications of the ACM

Industry: Information Technology > Services (0.86)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (0.92)
Information Technology > Communications > Web (0.86)

Add feedback

Reasoning Under Threat: Symbolic and Neural Techniques for Cybersecurity Verification

Veronica, Sarah

arXiv.org Artificial IntelligenceMar-27-2025

Cybersecurity demands rigorous and scalable techniques to ensure system correctness, robustness, and resilience against evolving threats. Automated reasoning, encompassing formal logic, theorem proving, model checking, and symbolic analysis, provides a foundational framework for verifying security properties across diverse domains such as access control, protocol design, vulnerability detection, and adversarial modeling. This survey presents a comprehensive overview of the role of automated reasoning in cybersecurity, analyzing how logical systems, including temporal, deontic, and epistemic logics are employed to formalize and verify security guarantees. We examine SOTA tools and frameworks, explore integrations with AI for neural-symbolic reasoning, and highlight critical research gaps, particularly in scalability, compositionality, and multi-layered security modeling. The paper concludes with a set of well-grounded future research directions, aiming to foster the development of secure systems through formal, automated, and explainable reasoning techniques.

artificial intelligence, jin song dong, logic & formal reasoning, (17 more...)

arXiv.org Artificial Intelligence

2503.22755

Country:

Oceania > Australia > Queensland > Brisbane (0.04)
Europe > France > Île-de-France > Paris > Paris (0.04)
Asia > Singapore (0.04)
(17 more...)

Genre: Overview (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Government > Military > Cyberwarfare (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)

Add feedback